智能论文笔记

PARSE challenge 2022: Pulmonary Arteries Segmentation using Swin U-Net Transformer(Swin UNETR) and U-Net

Akansh Maurya , Kunal Dashrath Patil , Rohan Padhy , Kalluri Ramakrishna , Ganapathy Krishnamurthi

分类：计算机视觉

2022-08-20

在这项工作中，我们介绍了我们提出的方法，该方法是使用SWIN UNETR和基于U-NET的深神经网络体系结构从CT扫描中分割肺动脉的方法。六个型号，基于SWIN UNETR的三个型号以及基于3D U-NET的三个模型，使用加权平均值来制作最终的分割掩码。我们的团队通过这种方法获得了84.36％的多级骰子得分。我们的工作代码可在以下链接上提供：https：//github.com/akansh12/parse2022。这项工作是Miccai Parse 2022挑战的一部分。

translated by 谷歌翻译

Inferring Class Label Distribution of Training Data from Classifiers: An Accuracy-Augmented Meta-Classifier Attack

Raksha Ramakrishna , György Dán

分类：机器学习 | 人工智能

2022-11-08

Property inference attacks against machine learning (ML) models aim to infer properties of the training data that are unrelated to the primary task of the model, and have so far been formulated as binary decision problems, i.e., whether or not the training data have a certain property. However, in industrial and healthcare applications, the proportion of labels in the training data is quite often also considered sensitive information. In this paper we introduce a new type of property inference attack that unlike binary decision problems in literature, aim at inferring the class label distribution of the training data from parameters of ML classifier models. We propose a method based on \emph{shadow training} and a \emph{meta-classifier} trained on the parameters of the shadow classifiers augmented with the accuracy of the classifiers on auxiliary data. We evaluate the proposed approach for ML classifiers with fully connected neural network architectures. We find that the proposed \emph{meta-classifier} attack provides a maximum relative improvement of $52\%$ over state of the art.

translated by 谷歌翻译

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

Federico Bianchi , Pratyusha Kalluri , Esin Durmus , Faisal Ladhak , Myra Cheng , Debora Nozza , Tatsunori Hashimoto , Dan Jurafsky , James Zou , Aylin Caliskan

分类：自然语言处理 | 计算机视觉

2022-11-07

Machine learning models are now able to convert user-written text descriptions into naturalistic images. These models are available to anyone online and are being used to generate millions of images a day. We investigate these models and find that they amplify dangerous and complex stereotypes. Moreover, we find that the amplified stereotypes are difficult to predict and not easily mitigated by users or model owners. The extent to which these image-generation models perpetuate and amplify stereotypes and their mass deployment is cause for serious concern.

translated by 谷歌翻译

Cluster-to-adapt: Few Shot Domain Adaptation for Semantic Segmentation across Disjoint Labels

Tarun Kalluri , Manmohan Chandraker

分类：计算机视觉 | 机器学习

2022-08-04

跨数据集的语义细分的域适应性，由相同类别组成，已经获得了一些最近的成功。但是，更一般的情况是源和目标数据集对应于非重叠标签空间时。例如，分割数据集中的类别根据环境或应用程序的类型发生了很大变化，但共享许多有价值的语义关系。基于特征对齐或差异最小化的现有方法不会考虑此类类别的转移。在这项工作中，我们提出了群集到适应（C2A），这是一种基于计算有效的聚类方法，用于跨分割数据集的域适应性，这些方法完全不同但可能相关类别。我们表明，在变换的特征空间中强制执行的这种聚类目标可以自动选择跨源和目标域的类别，这些类别可以对齐以改善目标性能，同时防止对无关类别的负转移。我们通过实验对室外的挑战性问题进行了实验，以少量拍摄和零拍设置来证明室内适应性的挑战性问题，在所有情况下，性能对现有方法和基准的绩效持续改善。

translated by 谷歌翻译

MemSAC: Memory Augmented Sample Consistency for Large Scale Domain Adaptation

Tarun Kalluri , Astuti Sharma , Manmohan Chandraker

分类：计算机视觉 | 人工智能 | 机器学习

2022-07-25

实用的现实世界数据集具有丰富的类别，为无监督的领域适应带来了新的挑战，例如小型阶层歧视性，仅依靠域不变性的现有方法不能很好地处理。在这项工作中，我们提出了MEMSAC，该MEMSAC利用了跨源和目标域的样本级别相似性，以实现判别性转移，以及扩展到大量类别的体系结构。为此，我们首先引入一种内存增强方法，以在标记的源和未标记的目标域实例之间有效提取成对的相似性关系，该实例适用于处理任意数量的类。接下来，我们建议和理论上证明对比损失的新型变体，以促进阶层内跨域样本之间的局部一致性，同时在类别之间执行分离，从而保留从源到目标的歧视性转移。我们验证了MEMSAC的优势，比以前的最先进的最先进的转移任务有了显着改进。我们还提供了深入的分析和对MEMSAC有效性的见解。

translated by 谷歌翻译

On the Opportunities and Risks of Foundation Models

Rishi Bommasani , Drew A. Hudson , Ehsan Adeli , Russ Altman , Simran Arora , Sydney von Arx , Michael S. Bernstein , Jeannette Bohg , Antoine Bosselut , Emma Brunskill

分类：机器学习 | 人工智能

2021-08-16

AI正在经历范式转变，随着模型的兴起（例如Bert，Dall-E，GPT-3），这些模型经过大规模的数据训练，并且可以适应广泛的下游任务。我们称这些模型基础模型来强调其至关重要但不完整的特征。该报告提供了基础模型的机会和风险的详尽说明，包括其功能（例如语言，愿景，机器人技术，推理，人类互动）和技术原则（例如，模型架构，培训程序，数据，系统，安全，安全性，评估，理论）对其应用（例如法律，医疗保健，教育）和社会影响（例如不平等，滥用，经济和环境影响，法律和道德考虑）。尽管基础模型基于标准的深度学习和转移学习，但它们的规模导致了新的新兴能力，以及它们在许多任务中的有效性都激发了同质化。同质化提供了强大的杠杆作用，但要求谨慎，因为基础模型的缺陷均由下游的所有适应模型继承。尽管即将广泛地部署基础模型，但我们目前对它们的工作方式，失败以及由于其新兴属性的影响而缺乏清晰的了解。为了解决这些问题，我们认为基础模型的许多批判性研究都需要与他们的基本社会技术性质相称。

translated by 谷歌翻译

The Values Encoded in Machine Learning Research

Abeba Birhane , Pratyusha Kalluri , Dallas Card , William Agnew , Ravit Dotan , Michelle Bao

分类：机器学习 | 人工智能

2021-06-29

机器学习目前对世界产生了巨大的影响，越来越多地影响机构实践并影响了社区。因此，至关重要的是，我们质疑该领域的模糊概念是价值中性或普遍有益的，并研究该领域正在发展的特定价值。在本文中，我们首先介绍了一种研究文档中编码的值的方法和注释方案，例如研究论文。采用该方案，我们分析了100个高度引用的机器学习论文，该论文在Premier机器学习会议，ICML和Neurips上发表。我们注释论文的关键特征，这些特征揭示了其价值观：他们选择项目的理由，这些项目的归因于他们提升的项目，对潜在的负面后果的考虑以及机构的隶属关系和资金来源。我们发现，很少有论文证明其项目如何与社会需求联系起来（15 \％），而讨论负潜力（1 \％）的讨论更少。通过逐行的内容分析，我们确定了59个在ML研究中得到提升的值，其中，我们发现论文最常根据绩效，概括，定量证据，效率，基于过去的绩效，定量证据，效率来证明和评估自己的合理性和评估工作和新颖。我们提供了广泛的文本证据，并在这些价值观的定义和操作中确定了关键主题。值得注意的是，我们发现系统的文本证据表明，这些最高价值是通过假设和含义来定义和应用的，通常支持权力的集中化。在本文中，我们发现这些高度引用的论文与科技公司和精英大学之间的关系越来越紧密。

translated by 谷歌翻译

A framework for deep learning emulation of numerical models with a case study in satellite remote sensing

Kate Duffy , Thomas Vandal , Weile Wang , Ramakrishna Nemani , Auroop R. Ganguly

分类：机器学习 | (统计)机器学习

2019-10-29

基于物理学的数值模型代表了地球系统建模中的最先进，包括我们的最佳工具，用于产生洞察和预测。尽管计算能力快速增长，但对更高模型分辨率的感知需求压倒了最新一代电脑，降低了建模者为理解参数敏感性和表征变异性和不确定性而产生模拟的能力。因此，通常开发了代理模型以捕获全吹制数值的基本属性。最近的机器学习方法的成功，尤其是深度学习，跨越许多学科提供了复杂的非线性连接者表示可能能够捕获地球系统中的底层复杂结构和非线性过程的可能性。基于深度学习的仿真的难度测试，这是指数值模型的近似，是为了了解它们是否可以在计算效率方面与传统形式的代理模型相当，同时再现模型以可靠的方式再现模型。可以预期通过该测试的深度学习仿真，而不是捕获复杂进程和时空依赖性的简单模型来表现更好。在这里，我们检查了基于卫星的遥感的案例研究，深度学习方法可以可靠地代表来自代理模型的模拟，具有可比的计算效率。我们的结果令人鼓舞的是，深度学习仿真以可接受的准确性再现结果，并且往往更快的性能。我们阐明了我们对深度学习的高性能实现的改进步伐的更广泛的影响以及地球科学中更高分辨率模拟的渴望。

translated by 谷歌翻译

Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization

Ramprasaath R. Selvaraju , Michael Cogswell , Abhishek Das , Ramakrishna Vedantam , Devi Parikh , Dhruv Batra

分类：

2016-10-07

We propose a technique for producing 'visual explanations' for decisions from a large class of Convolutional Neural Network (CNN)-based models, making them more transparent and explainable.Our approach -Gradient-weighted Class Activation Mapping (Grad-CAM), uses the gradients of any target concept (say 'dog' in a classification network or a sequence of words in captioning network) flowing into the final convolutional layer to produce a coarse localization map highlighting the important regions in the image for predicting the concept.Unlike previous approaches, Grad-CAM is applicable to a wide variety of CNN model-families: (1) CNNs with fullyconnected layers (e.g. VGG), (2) CNNs used for structured outputs (e.g. captioning), (3) CNNs used in tasks with multimodal inputs (e.g. visual question answering) or reinforcement learning, all without architectural changes or re-training. We combine Grad-CAM with existing fine-grained visualizations to create a high-resolution class-discriminative vi-

translated by 谷歌翻译

Convolutional Pose Machines

Shih-En Wei , Varun Ramakrishna , Takeo Kanade , Yaser Sheikh

分类：

2016-01-30

Pose Machines provide a sequential prediction framework for learning rich implicit spatial models. In this work we show a systematic design for how convolutional networks can be incorporated into the pose machine framework for learning image features and image-dependent spatial models for the task of pose estimation. The contribution of this paper is to implicitly model long-range dependencies between variables in structured prediction tasks such as articulated pose estimation. We achieve this by designing a sequential architecture composed of convolutional networks that directly operate on belief maps from previous stages, producing increasingly refined estimates for part locations, without the need for explicit graphical model-style inference. Our approach addresses the characteristic difficulty of vanishing gradients during training by providing a natural learning objective function that enforces intermediate supervision, thereby replenishing back-propagated gradients and conditioning the learning procedure. We demonstrate state-of-the-art performance and outperform competing methods on standard benchmarks including the MPII, LSP, and FLIC datasets.

translated by 谷歌翻译